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3 <110> APPLICANT: Tischer, Wilhelm 

y 4 Ihlenfeldt, Hans-Georg 

5 Barzu, Octavian 

6 Sakamoto, Hiroshi 

7 Pistotnik, Elisabeth 

8 Marliere, Philippe 

9 Pochet, Sylvie 

12 <120> TITLE OF INVENTION: Enzymatic synthesis of deoxyribonucleosides 

14 <130> FILE REFERENCE: 20373PWO Deoxyribonucleosides 

C--> 16 <140> CURRENT APPLICATION NUMBER: US/10/049,750 

17 <141> CURRENT FILING DATE: 2002-02-15 

19 <150> PRIOR APPLICATION NUMBER: EP99116425.2 

20 <151> PRIOR FILING DATE: 1999-08-20 
22 <160> NUMBER OF SEQ ID NOS : 20 

. 24 <170> SOFTWARE: Patentln Ver . 2.1 

26 <210> SEQ ID NO: 1 

27 <211> LENGTH: 1323 
a • 28 <212> TYPE: DNA 

29 <213> ORGANISM: Escherichia coli 

31 <220> FEATURE: 

32 <221> NAME/KEY: CDS 

* 33 <222> LOCATION: (1)..(1320) 
37 <400> SEQUENCE: 1 



38 


ttg 


ttt 


etc 


gca 


caa 


gaa 


att 


att 


cgt 


aaa 


aaa 


cgt 


gat 


ggt 


cat 


gcg 


48 


39 


Leu 


Phe 


Leu 


Ala 


Gin 


Glu 


He 


He 


Arg 


Lys 


Lys 


Arg 


Asp 


Gly 


His 


Ala 




40 


1 








5 










10 










15 






42 


ctg 


age 


gat 


gaa 


gaa 


att 


cgt 


ttc 


ttt 


ate 


aac 


ggt 


att 


cgc 


gac 


aac 


96 


43 


Leu 


Ser 


Asp 


Glu 


Glu 


He 


Arg 


Phe 


Phe 


He 


Asn 


Gly 


He 


Arg 


Asp 


Asn 




44 








20 










25 










30 








46 


act 


ate 


tec 


gaa 


ggg 


cag 


att 


gec 


gec 


etc 


gcg 


atg 


acc 


att 


ttc 


ttc 


144 


47 


Thr 


He 


Ser 


Glu 


Gly 


Gin 


He 


Ala 


Ala 


Leu 


Ala 


Met 


Thr 


He 


Phe 


Phe 




48 






35 










40 










45 










51 


cac 


gat 


atg 


aca 


atg 


cct 


gag 


cgt 


gtc 


teg 


ctg 


acc 


atg 


gcg 


atg 


cga 


192 


52 


His 


Asp 


Met 


Thr 


Met 


Pro 


Glu 


Arg 


Val 


Ser 


Leu 


Thr 


Met 


Ala 


Met 


Arg 




53 




50 










55 










60 












55 


gat 


tea 


gga 


acc 


gtt 


etc 


gac 


tgg 


aaa 


age 


ctg 


cat 


ctg 


aat 


ggc 


ccg 


240 


56 


Asp 


Ser 


Gly 


Thr 


Val 


Leu 


Asp 


Trp 


Lys 


Ser 


Leu 


His 


Leu 


Asn 


Gly 


Pro 




57 


65 










70 










75 










80 




59 


att 


gtt 


gat 


aaa 


cac 


tec 


acc 


ggt 


ggc 


gtc 


ggc 


gat 


gtg 


act 


teg 


ctg 


288 


60 


He 


Val 


Asp 


Lys 


His 


Ser 


Thr 


Gly 


Gly 


Val 


Gly 


Asp 


Val 


Thr 


Ser 


Leu 




61 










85 










90 










95 






63 


atg 


ttg 


ggg 


ccg 


atg 


gtc 


gca 


gec 


tgc 


ggc 


ggc 


tat 


att 


ccg 


atg 


ate 


336 


64 


Met 


Leu 


Gly 


Pro 


Met 


Val 


Ala 


Ala 


Cys 


Gly 


Gly 


Tyr 


He 


Pro 


Met 


He 





file://C:\Crf3\Outhold\VsrJ049750.htm 



3/1/02 



Page 2 of 7 



RAW SEQUENCE LISTING DATE : 03/01/2002 

PATENT APPLICATION: US/10/049 f 750 TIME: 15:01:29 



Input Set : A:\EP.txt 

Output Set: N:\CRF3\03012002\J049750.raw 



65 








100 










105 










110 








67 


tct 


ggt 


cgc 


ggc 


etc 


ggt 


cat 


act 


ggc 


ggt 


acg 


etc 


gac 


aaa 


ctg 


gaa 


384 


68 


Ser 


Gly 


Arg 


Gly 


Leu 


Gly His 


Thr 


Gly 


Gly 


Thr 


Leu 


Asp 


Lys 


Leu 


Glu 




69 






115 










120 










125 










72 


tec 


ate 


cct 


ggc 


ttc 


gac 


att 


ttc 


ccg 


gat 


gac 


aac 


cgt 


ttc 


cgc 


gaa 


432 


73 


Ser 


He 


Pro 


Gly 


Phe 


Asp 


He 


Phe 


Pro 


Asp 


Asp 


Asn 


Arg 


Phe 


Arg 


Glu 




74 




130 










135 










140 












76 


att 


att 


aaa 


gac 


gtc 


ggc 


gtg 


gcg 


att 


ate 


ggt 


cag 


acc 


agt 


tea 


ctg 


480 


77 


He 


He 


Lys 


Asp 


val 


Gly Val 


Ala 


He 


He 


Gly 


Gin 


Thr 


Ser 


Ser 


Leu 




78 


145 








150 










155 










160 




80 


get 


ccg 


get 


gat 


aaa 


cgt 


ttc 


tac 


gcg 


acc 


cgt 


gat 


att 


acc 


gca 


acc 


528 


81 


Ala 


Pro 


Ala 


Asp 


Lys 


Arg 


Phe 


Tyr 


Ala 


Thr 


Arg 


Asp 


He 


Thr 


Ala 


Thr 




82 






* 




165 










170 










175 






84 


gtg 


gac 


tec 


ate 


ccg 


ctg 


ate 


acc 


gee 


tct 


att 


ctg 


gcg 


aag 


aaa 


ctt 


576 


' 85 


Val 


Asp 


Ser 


He 


Pro 


Leu 


He 


Thr 


Ala 


Ser 


He 


Leu 


Ala 


Lys 


Lys 


Leu 




86 








180 










185 










190 






88 


gcg 


gaa 


ggt 


ctg 


gac 


gcg 


ctg 


gtg 


atg 


gac 


gtg 


aaa 


gtg 


ggt 


age 


ggc 


624 


89 


Ala 


Glu 


Gly 


Leu 


A sp 


Ala 


Leu 


Val 


Met 


Asp 


Val 


Lys 


Val 


Gly 


Ser Gly 




90 






195 










200 










205 










92 


gcg 


ttt 


atg 


ccg 


acc 


tac 


gaa 


etc 


tct 


gaa 


gee 


ctt 


gec 


gaa 


gcg 


att 


672 


93 


Ala 


Phe 


Met 


Pro 


Thr 


Tyr 


Glu 


Leu 


Ser 


Glu 


Ala 


Leu 


Ala 


Glu 


Ala 


lie 




94 




210 










215 










220 












96 


gtt 


ggc 


gtg 


get 


aac 


ggc 


get 


ggc 


gtg 


cgc 


acc 


acc 


gcg 


ctg 


etc 


acc 


720 


, 97 


Val 


Gly 


Val 


Ala 


Asn 


Gly Ala Gly 


Val 


Arg 


Thr 


Thr 


Ala 


Leu 


Leu 


Thr 




98 


225 










230 










235 










240 





100 


gac 


atg 


aat 


cag 


gta 


ctg 


gec 


tec 


agt 


gca 


ggt 


aac 


gcg 


gtt 


gaa 


gtt 


768 


101 


Asp 


Met 


Asn 


Gin 


Val 


Leu 


Ala 


Ser 


Ser 


Ala 


Gly Asn 


Ala 


Val 


Glu 


Val 




102 










245 










250 










255 






106 


cgt 


gaa 


gcg 


gtg 


cag 


ttc 


ctg 


acg 


ggt 


gaa 


tat 


cgt 


aac 


ccg 


cgt 


ctg 


816 


107 


Arg 


Glu 


Ala 


Val 


Gin 


Phe 


Leu 


Thr 


Gly Glu 


Tyr 


Arg 


Asn 


Pro 


Arg 


Leu 




108 








260 










265 










270 








110 


ttt 


gat 


gtc 


acg 


atg 


gcg 


ctg 


tgc 


gtg 


gag 


atg 


ctg 


ate 


tec 


ggc 


aaa 


864 


111 


Phe 


Asp 


Val 


Thr 


Met 


Ala 


Leu 


Cys 


Val 


Glu 


Met 


Leu 


He 


Ser Gly Lys 




112 






275 










280 










285 










114 


ctg 


gcg 


aaa 


gat 


gac 


gee 


gaa 


gcg 


cgc 


gcg 


aaa 


ttg 


cag 


gcg 


gtg 


ctg 


912 


115 


Leu 


Ala 


Lys 


Asp 


Asp 


Ala 


Glu 


Ala 


Arg 


Ala 


Lys 


Leu 


Gin 


Ala 


Val 


Leu 




116 




290 










295 










300 












118 


gac 


aac 


ggt 


aaa 


gcg 


gca 


gaa 


gtc 


ttt 


ggt 


cgt 


atg 


gta 


gcg 


gca 


caa 


960 


119 


Asp 


Asn 


Gly 


Lys 


Ala 


Ala 


Glu 


Val 


Phe Gly Arg Met 


Val 


Ala 


Ala 


Gin 




120 


305 










310 










315 










320 




122 


aaa 


ggc 


ccg 


acc 


gac 


ttc 


gtt 


gag 


aac 


tac 


gcg 


aag 


tat 


ctg 


ccg 


aca 


1008 


123 


Lys 


Gly 


Pro 


Thr 


Asp 


Phe 


Val 


Glu 


Asn 


Tyr 


Ala 


Lys 


Tyr 


Leu 


Pro 


Thr 




124 










325 










330 










335 






126 


gcg 


atg 


ctg 


acg 


aaa 


gca 


gtc 


tat 


get 


gat 


acc 


gaa 


ggt 


ttt 


gtc 


agt 


1056 


127 


Ala 


Met 


Leu 


Thr 


Lys 


Ala 


Val 


Tyr 


Ala 


Asp 


Thr 


Glu 


Gly 


Phe 


Val 


Ser 




128 








340 










345 










350 








130 


gaa 


atg 


gat 


acc 


cgc 


gcg 


ctg 


ggg 


atg 


gca 


gtg 


gtt 


gca 


atg 


ggc 


ggc 


1104 


131 


Glu 


Met 


Asp 


Thr 


Arg 


Ala 


Leu 


Gly 


Met 


Ala 


Val 


Val 


Ala Met Gly Gly 




132 






355 










360 










365 











file://C:\Crf3\Outhold\VsrJ049750.htm 



3/1/02 



Page 3 of 7 



RAW SEQUENCE LISTING DATE : 03/01/2002 

PATENT APPLICATION: US/10/04 9 , 750 TIME: 15:01:29 



Input Set : A:\EP.txt 

Output Set: N:\CRF3\03012002\J049750.raw 



134 


gga 


cgc 


cgt 


cag 


gca 


tct 


gac 


acc 


ate 


gat 


tac 


age 


gtc 


ggc 


ttt 


act 


1152 


135 Gly 


Arg 


Arg 


Gin 


Ala 


Ser 


Asp 


Thr 


He 


Asp 


Tyr 


Ser 


Val 


Gly 


Phe 


Thr 




136 




370 




- 






375 










380 












140 


gat 


atg 


gcg 


cgt 


ctg 


ggc 


gac 


cag 


gta 


gac 


ggt 


cag 


cgt 


ccg 


ctg 


gcg 


1200 


141 


Asp 


Met 


Ala 


Arg 


Leu 


Gly 


Asp 


Gin 


Val 


Asp 


Gly Gin Arg 


Pro 


Leu 


Ala 




142 


385 










390 










395 










400 




144 


gtt 


ate 


cac 


gcg 


aaa 


gac 


gaa 


aac 


aac 


tgg 


cag 


gaa 


gcg 


gcg 


aaa 


gcg 


1248 


145 


Val 


He 


His 


Ala 


Lys 


Asp 


Glu 


Asn 


Asn 


Trp 


Gin 


Glu 


Ala 


Ala 


Lys 


Ala 




146 










405 










410 










415 






148 


gtg 


aaa 


gcg 


gca 


att 


aaa 


ctt 


gee 


gat 


aaa 


gca 


ccg 


gaa 


age 


aca 


cca 


1296 


149 


Val 


Lys 


Ala 


Ala 


He 


Lys 


Leu 


Ala 


Asp 


Lys 


Ala 


Pro 


Glu 


Ser 


Thr 


Pro 




150 








420 










425 










430 








152 


act 


gtc 


tat 


cgc 


cgt 


ate 


age 


gaa 


taa 
















1323 


153 


Thr 


Val 


Tyr 


Arg 


Arg 


He 


Ser 


Glu 




















154 






435 










440 





















157 <210> SEQ ID NO: 2 

158 <211> LENGTH: 440 

159 <212> TYPE: PRT 

160 <213> ORGANISM: Escherichia coli 
162 <400> SEQUENCE: 2 
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L . 251 435 440 

255 <210> SEQ ID NO: 3 

256 <211> LENGTH: 720 

257 <212> TYPE: DNA 

258 <213> ORGANISM: Escherichia coli 
260 <220> FEATURE: 

2 61 <221> NAME/KEY: CDS 

262 <222> LOCATION: (1)..(717) 

264 <400> SEQUENCE: 3 
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285 65 70 75 80 

287 ttc ggc gtg aag aaa att ate cgc gtg ggt tec tgt ggc gca gtt ctg 288 

288 Phe Gly Val Lys Lys He He Arg Val Gly Ser Cys Gly Ala Val Leu 

289 85 90 95 

291 ccg cac gta aaa ctg cgc gac gtc gtt ate ggt atg ggt gee tgc acc 336 

292 Pro His Val Lys Leu Arg Asp Val Val He Gly Met Gly Ala Cys Thr 

293 100 105 "* 110 

295 gat tec aaa gtt aac cgc ate cgt ttt aaa gac cat gac ttt gee get 384 

296 Asp Ser Lys Val Asn Arg He Arg Phe Lys Asp His Asp Phe Ala Ala 

297 115 120 125 

299 ate get gac ttc gac atg gtg cgt aac gca gta gat gca get aaa gca 432 

300 He Ala Asp Phe Asp Met Val Arg Asn Ala Val Asp Ala Ala Lys Ala 

301 130 135 140 

303 ctg ggt att gat get cgc gtg ggt aac ctg ttc tec get gac ctg ttc 480 

304 Leu Gly He Asp Ala Arg Val Gly Asn Leu Phe Ser Ala Asp Leu Phe 

305 145 150 155 160 

307 tac tct ccg gac ggc gaa atg ttc gac gtg atg gaa aaa tac ggc att 528 

308 Tyr Ser Pro Asp Gly Glu Met Phe Asp Val Met Glu Lys Tyr Gly He 

309 165 170 ^ 175 

313 etc ggc gtg gaa atg gaa gcg get ggt ate tac ggc gtc get gca gaa 576 

314 Leu Gly Val Glu Met Glu Ala Ala Gly He Tyr Gly Val Ala Ala Glu 

315 180 185 190 

317 ttt ggc gcg aaa gee ctg acc ate tgc acc gta tct gac cac ate cgc 624 

318 Phe Gly Ala Lys Ala Leu Thr He Cys Thr Val Ser Asp His He Arg 

319 195 200 205 

321 act cac gag cag acc act gee get gag cgt cag act acc ttc aac gac 672 

322 Thr His Glu Gin Thr Thr Ala Ala Glu Arg Gin Thr Thr Phe Asn Asp 

323 210 215 220 

325 atg ate aaa ate gca ctg gaa tec gtt ctg ctg ggc gat aaa gag taa 720 

32 6 Met He Lys He Ala Leu Glu Ser Val Leu Leu Gly Asp Lys Glu 
327 225 230 235 

330 <210> SEQ ID NO: 4 

331 <211> LENGTH: 239 

332 <212> TYPE: PRT 

333 <213> ORGANISM: Escherichia coli 
335 <400> SEQUENCE: 4 

33 6 Met Ala Thr Pro His He Asn Ala Glu Met Gly Asp Phe Ala Asp Val 
337 15 10 15 

339 Val Leu Met Pro Gly Asp Pro Leu Arg Ala Lys Tyr He Ala Glu Thr 

340 20 25 ^ 30 

342 Phe Leu Glu Asp Ala Arg Glu Val Asn Asn Val Arg Gly Met Leu Gly 

343 35 40 45 

34 7 Phe Thr Gly Thr Tyr Lys Gly Arg Lys lie Ser Val Met Gly His Gly 
34 8 50 55 60 

350 Met Gly He Pro Ser Cys Ser He Tyr Thr Lys Glu Leu He Thr Asp 

351 65 70 75 80 

353 Phe Gly Val Lys Lys He lie Arg Val Gly Ser Cys Gly Ala Val Leu 

354 85 90 95 
356 Pro His Val Lys Leu Arg Asp Val Val He Gly Met Gly Ala Cys Thr 
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